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Abstract. We address the problem of Conjunctive Query Answering 
(CQA) for the description logic 2TC(4LQS R,X }(D) (T>£^ x , for short) 
which extends the logic D£(4LQS R )(D) with Boolean operations on con- 
crete roles and with the product of concepts. 

The result is obtained by formalizing T>£^j x -knowledge bases and T>£^ x - 
conjunctive queries in terms of formulae of the four-level set-theoretic 
fragment 4LQS R , which admits a restricted form of quantification on 
variables of the first three levels and on pair terms. We solve the CQA 
problem for T >£ ^ x through a decision procedure for the satisfiability 
problem of 4LQS R . We further define a KE-tableau based procedure for 
the same problem, more suitable for implementation purposes, and ana- 
lyze its computational complexity. 


1 Introduction 

In the last few years, results from Computable Set Theory have been used as a 
means to represent and reason about description logics and rule languages for the 
semantic web. For instance, in [6,8,9], fragments of set theory with constructs re- 
lated to multi-valued maps have been studied and applied to the realm of knowl- 
edge representation. In [10], an expressive description logic, called P/^MLSSJ m ), 
has been introduced and the consistency problem for £>£(MLSS 2 TO )-knowledge 
bases has been proved NP-complete. The description logic P/^MLSS? m ) has 
been extended with additional constructs and SWRL rules in [9], proving that 
the decision problem for the resulting logic, called VC(\/q 2 ), is still NP-complete 
under suitable conditions. The description logic VC(\/q 2 ) has been extended with 
some metamodelling features in [6]. In [7], the description logic 2?£(4LQS R )(D) 
(more simply referred to as VC^) has been introduced. VC q can be represented 
in the decidable four-level stratified fragment of set theory 4LQS R involving a 
restricted form of quantification over variables of the first three levels and pair 
terms (cf. [4]). The logic VC ^ admits concept constructs such as full nega- 
tion, union and intersection of concepts, concept domain and range, existential 
quantification and min cardinality on the left-hand side of inclusion axioms. 
It also supports role constructs such as role chains on the left hand side of 
inclusion axioms, union, intersection, and complement of abstract roles, and 



properties on roles such as transitivity, symmetry, reflexivity, and irreflexivity. 
It admits datatypes, a simple form of concrete domains that are relevant in real 
world applications. The consistency problem for PT^-knowledge bases has been 
proved decidable in [7] by means of a reduction to the satisfiability problem 
for 4LQS r , proved decidable in [4]. It has also been proved, under not very re- 
strictive constraints, that the consistency problem for "DTo-knowledge bases is 
NP-complete. Finally, we mention that the papers [6-10] are concerned with 
traditional research issues for description logics mainly focused on the parts of a 
knowledge base representing conceptual information, namely the TBox and the 
RBox, where the principal reasoning services are subsumption and satisfiability. 

In this paper we exploit decidability results presented in [4, 7] to deal with 
reasoning services for knowledge bases involving ABoxes. The most basic ser- 
vice to query the instance data is instance retrieval , i.e. , the task of retrieving 
all individuals that instantiate a class C, and, dually, all named classes C that 
an individual belongs to. In particular, a powerful way to query ABoxes is the 
Conjunctive Query Answering task (CQA). CQA is relevant in the context of de- 
scription logics and, in particular, for real world applications based on semantic 
web technologies, since it provides a mechanism allowing users and applications 
to interact with ontologies and data. The task of CQA has been studied for 
several well-known description logics (cf. [1-3,13-18,20-23]). In particular, we 
introduce the description logic X>£(4LQS R,X )(D) (T>£^ x , for short), extending 
with Boolean operations on concrete roles and with the product of con- 
cepts. Then we define the CQA problem for T>C^ and prove its decidability via 
a reduction to the CQA problem for 4LQS R , whose decidability follows from that 
of the satisfiability problem for 4LQS R (proved in [4]). Finally, we present a KE- 
tableau based procedure that, given a VC^ -query Q and a -knowledge 

base KB represented in set-theoretic terms, determines the answer set of Q with 
respect to KB , providing also some complexity results. The choice of the KE- 
tableau system [11] is motivated by the fact that this variant of the tableau 
method allows one to construct trees whose distinct branches define mutually 
exclusive situations thus avoiding the proliferation of redundant branches, typi- 
cal of semantic tableaux. 

2 Preliminaries 

2.1 The set-theoretic fragment 4LQS R 

It is convenient to first introduce the syntax and semantics of a more general 
four-level quantified language, denoted 4LQS. Then we provide some restrictions 
on quantified formulae of 4LQS that characterize 4LQS R . We recall that the 
satisfiability problem for 4LQS R has been proved decidable in [4], 

4LQS involves four collections, V*, of variables of sort i, for i = 0, 1, 2, 3. Vari- 
ables of sort i, for i = 0,1, 2, 3, will be denoted by X 1 , Y' 1 , Z l , . . . (in particular, 
variables of sort 0 will also be denoted by x,y,z , . . .). In addition to variables, 
4LQS involves also pair terms of the form (x,y), with x,y € Vo. 

4LQS- quantifier- free atomic formulae are classified as: 



- level 0: x = y, x G X 1 , (x,y) = X 2 , (x,y) £ X 3 ; 

- level 1: X 1 = Y\ X 1 € X 2 -, 

- level 2: X 2 = V 2 , X 2 G X 3 . 

4LQS purely universal formulae are classified as: 

- level 1: (Vzi) . . . ( \/z n )po , where z \, . . . , z n G Vo and po is any propositional 
combination of quantifier-free atomic formulae of level 0; 

- level 2: (VZ}) . . . (VZfJpx, where Z\ , . . . , Z ^ G Vi and p\ is any proposi- 
tional combination of quantifier- free atomic formulae of levels 0 and 1, and 
of purely universal formulae of level 1 ; 

- level 3: (VZ 2 ) . . . (\/Z 2 )p 2 , where Zf , . . . , Z 2 G V 2 and P 2 is any propositional 
combination of quantifier-free atomic formulae and of purely universal for- 
mulae of levels 1 and 2. 

4LQS-formulae are all the propositional combinations of quantifier-free atomic 
formulae of levels 0, 1, 2 and of purely universal formulae of levels 1, 2, 3. 

Let p be a 4LQS-formula. Without loss of generality, we can assume that 
p contains only - 1 , A, V as propositional connectives. Further, let S v be the 
syntax tree for a 4LQS-formula p, 1 and let v be a node of S v . We say that a 
4LQS-formula if) occurs within p at position v if the subtree of S v rooted at v 
is identical to S In this case we refer to v as an occurrence of ip in p and to 
the path from the root of S v to u as its occurrence path. An occurrence of xf 
within p is positive if its occurrence path deprived by its last node contains an 
even number of nodes labelled by a 4LQS-formula of type ~<X- Otherwise, the 
occurrence is said to be negative. 

The variables z ±, . . . , z n are said to occur quantified in (Vzi) . . . (\/z n )po. Like- 
wise, Z {, . . . , Z ^ and Z 2 , . . . , Z 2 occur quantified in (VZ|) . . . (f/Zf^pi and in 
(\/Z 2 ) . . . (yZ 2 )q> 2 , respectively. A variable occurs free in a 4LQS-formula p if it 
does not occur quantified in any subformula of (p. For i = 0, 1, 2, 3, we denote 
with Var.; (<p) the collections of variables of level i occurring free in ip. 

A (level 0) substitution cr := {xi/yi , . . . , x n /y n } is the mapping p 1 — > pa such 
that, for any given 4LQS-formula p , pa is the 4LQS-formula obtained from p by 
replacing the free occurrences of the variables xi, ... ,x n in p with the variables 
j/i, . . . , y n , respectively. We say that a substitution a is free for p if the formulae 
p and pa have exactly the same occurrences of quantified variables. 

A 4LQS -interpretation is a pair Xi = (D, M) where D is a non-empty col- 
lection of objects (called domain or universe of Xi) and M is an assignment 
over the variables in Vi, for i = 0, 1, 2, 3, such that: 

MX 0 G D , MX 1 G V(D), MX 2 G V{V{D)), MX 3 G V{V(V(D))), 
where X* G Vi, for i = 0, 1, 2, 3, and V{s) denotes the powerset of s. 

Pair terms are interpreted a la Kuratowski, and therefore we put 

The notion of syntax tree for 4LQS-formulae is similar to the notion of syntax tree 

for formulae of first-order logic. A precise definition of the latter can be found in [12]. 



M(x,y ) := {{Mx},{Mx,My}}. 


Next, let 

- Ad — ( D,M ) be a 4LQS-interpretation, 

- X!,...,x n eVo, X \, . . . ,X 2 G V 2 , and 

- Ul ,...,Un eD, U x ,...,U^eV(D), u*,...,u$ep{p(D)). 

By JYl[x/u, X 1 /U 1 , X 2 /U 2 ], we denote the interpretation Ad' = ( D,M ') such 
that M'xi = Ui (for i = 1 M’Xj = Uj (for j = 1 M'X% = 

U% (for k = 1 , ,p), and which otherwise coincides with M on all remaining 
variables. For a 4LQS-interpretation Ad = ( D , M) and a 4LQS-formula p, the 
satisfiability relationship Ad j= tp is defined inductively over the structure of 
ip as follows. Quantifier-free atomic formulae are evaluated in a standard way 
according to the usual meaning of the predicates ‘s’ and and purely universal 
formulae are evaluated as follows: 

- Ad |= (Vzi) . . . (Vz n )ipo iff M[z/u] \= <po, for all u e D n \ 

- Ad |= (VZJ) . . . (VZ^)^i iff M.[Z X IU X ) |= ip u for all U 1 G ( V(D)) m ; 

- Ad h (V^i 2 ) . . . (VZ2)<p 2 iff M[Z 2 /U 2 } |= <p 2 , for all U 2 G ( V(V{D))) P . 

Finally, compound formulae are interpreted according to the standard rules 
of propositional logic. If Ad f= ip, then Ad is said to be a 4LQS-model for < p. A 
4LQS-formula is said to be satisfiable if it has a 4LQS-model. A 4LQS-formula 
is valid if it is satisfied by all 4LQS-interpretations. 

We are now ready to present the fragment 4LQS R of 4LQS of our interest. 
This is the collection of the formulae of 4LQS fulfilling the restrictions: 

1. for every purely universal formula (V-Z^) . . . (VZ^ip i of level 2 occurring in 
i j) and every purely universal formula (Vzi) . . . (\/z n )<fo of level 1 occurring 
negatively in ip \ , tpo is a propositional combination of quantifier-free atomic 
formulae of level 0 and the condition 

n m 

_1 ¥ , 0 ^ A l\ Z i € Z] 

*= 1 i=i 

is a valid 4LQS-formula (in this case we say that (V 21 ) . . . (\/z n )tpo is linked 
to the variables Z \, . . . , Z 

2. for every purely universal formula {'iZ 2 ) . . . (yZ 2 )ip 2 of level 3 in f/>: 

- every purely universal formula of level 1 occurring negatively in ip 2 and 
not occurring in a purely universal formula of level 2 is only allowed to 
be of the form 

n n 

(Vzi) • • • (V2„M A A <2*, Zj) = Y ?j), 

i—1 i= 1 

with Y 2 G V 2 , for i, j = 1, . . . , n; 

- purely universal formulae (VZ X ) . . . (VZ^ip i of level 2 may occur only 
positively in ip 2 . 

Restriction 1 has been introduced for technical reasons concerning the decid- 
ability of the satisfiability problem for the fragment, while restriction 2 allows 



one to define binary relations and several operations on them (for space reasons 
details are not included here but can be found in [4]). 

The semantics of 4LQS R plainly coincides with that of 4LQS. 


2.2 The logic 2?£(4LQS R ’ X ) (D) 

The description logic 2?£(4LQS R,X )(D) ( more simply referred to as VC ^ X ) is 
the extension of the logic 2?£(4LQS R )(D) (for short VC &) presented in [7] in 
which Boolean operations on concrete roles and the product of concepts are 
admitted. Analogously to VCp, the logic VC supports concept constructs 
such as full negation, union and intersection of concepts, concept domain and 
range, existential quantification and min cardinality on the left-hand side of 
inclusion axioms, role constructs such as role chains on the left hand side of 
inclusion axioms, union, intersection, and complement of roles, and properties 
on roles such as transitivity, symmetry, reflexivity, and irreflexivity. 

As far as the construction of role inclusion axioms is concerned, VC^f is 
more liberal than SlZOXQ(T>) (the logic underlying the most expressive Ontol- 
ogy Web Language 2 profile, OWL 2 DL [24]), since the roles involved are not 
required to be subject to any ordering relationship, and the notion of simple 
role is not needed. VC treats derived datatypes by admitting datatype terms 
constructed from data ranges by means of a finite number of applications of 
the Boolean operators. Basic and derived datatypes can be used inside inclusion 
axioms involving concrete roles. 

Datatypes are defined according to [19] as follows. Let D = (Np,Nc, Np, - D ) 
be a datatype map, where Np is a finite set of datatypes, Np is a map assigning 
a set of constants Nc(d) to each datatype d G N D , Np is a map assigning a set 
of facets Np(d) to each d G Np, and D is a map assigning 

(i) a datatype interpretation dP to each datatype d € Np, 

(ii) a facet interpretation / D C dP to each facet / € Np(d), and 

(iii) a data value G d D to every constant G Np{d). 

We shall assume that the interpretations of the datatypes in Np are non- 
empty pairwise disjoint sets. 

A facet expression for a datatype d G Np is a formula ifd constructed from 
the elements of Np(d) U {T d ,_L d } by applying a finite number of times the 
connectives -i, A, and V. The function D is extended to facet expressions for 
d G Np by putting for /, fi, fi G Np (d) 



- (-/) D =d D \/ D , 

- (/i a/ 2 ) d = /P n/l 3 , 

- (/iV/ 2 ) D = A D u/f\ 



A data range dr for D is either a datatype d £ Njj, or a finite enumeration 
of datatype constants {e dl , . . . , e dn }, with e di £ Nc(di) and di £ Njj, or a facet 
expression ip d , for d £ Njj, or their complementation. 

Let Ra, Rd, C, Ind be denumerable pairwise disjoint sets of abstract role 
names, concrete role names, concept names, and individual names, respectively. 
We assume that the set of abstract role names Ra contains a name U denoting 
the universal role. 

(a) -datatype, (b) VC^ -concept, (c) VC^ -abstract role, and (d) V /In- 
concrete role terms are constructed according to the following syntax rules: 

(a) t!,t 2 — > dr | -iti | ti n t 2 | tr U t 2 | {e d } , 

(b) Ci, C 2 — ► A | T | _L | -.Ci | CiUC 2 | C x nC 2 | {a} \ 3 R.Self\ 3 R.{a}\ 3 P.{e d } , 

(c) R u R 2 — > S I U I i?r I -^Ri I R1UR2 | RinR 2 I R Cl 1 | R\ Cl I R Cl \ c 2 I id{C) I 
Ci x C 2 , 

(d) P 1 ,P 2 — T | -.Pr | Pi U P 2 | Pi n P 2 | P Cl , | P, tl | P Cl | tl , 

where dr is a data range for D, t\, t 2 are data- type terms, e d is a constant in 
Nc(d), a is an individual name, A is a concept name, C\,C 2 are -concept 
terms, S is an abstract role name, P, Pi,P 2 are -abstract role terms, T is 
a concrete role name, and P, P 1 ; P 2 are VC^ -concrete role terms. 

A VC^ -knowledge base is a triple ICB = ( 11 , T, A) such that 1 Z is a V£^ x - 
RBox, T is a VC^-TBox, and A a -ABox (see next). 

A VC^ -RBox is a collection of statements of the following forms: 

Pi = P 2 , Pi E P 2 , Pi • • • R n E Rn+ 1, Sym(Pi), Asym(Pi), 
Ref(P!), Irref(Pi), Dis(Pi,P 2 ), Tra(Pi), Fun(Pi), 
Pi = Ci x C 2 , Pi = P 2 , P1EP2, Dis(Pi,P 2 ), Fun(Pi), 

where Pi,P 2 are -abstract role terms, Ci,C 2 are -abstract con- 
cept terms, and P\,P 2 are PC^ X -concrete role terms. Any expression of the 
type w E R , where w is a finite string of P£q X -abstract role terms and P is an 
PCq X - abstract role term is called a role inclusion axiom (RIA). 

Next, a VC^-TBox is a set of statements of the types: 

Ci = C 2 , Ci E C 2 , Ci E VPi.C 2 , BPi.Ci E C 2 , >„Pi.Ci E C 2 , Ci E <»Pi.C 2t 
h = t 2 , t\ E t 2 , C1EVP1.C, 3 Pi.ii E Ci, >„Pi.CECi, Ci E < n Pi-ti, 

where Ci, C 2 are -concept terms, t\,t 2 datatype terms, Pi a -abstract 
role term, and Pi a - concrete role term. Any statement CEP, with C 

and D -concept terms, is a general concept inclusion axiom (GCI). 
Finally, a -ABox is a set of individual assertions of the forms: 

a = b, e d : ti, (a,e d ) : Pi, 


a : Ci, (a, b ) : Pi 



with a,b individual names, C\ a -concept term, R\ a -abstract role 
term, d a datatype, e d a constant in Nc(d), t\ a datatype term, and P\ a VCyf - 
concrete role term. 

The semantics of is based on interpretations I = (ZA 1 , ZAd, - 1 ), where 

A 1 and ZAd are non-empty disjoint domains such that d° C ZAd 5 for every 
d £ Njj, and - 1 is an interpretation function. The interpretation of concepts and 
roles, axioms and assertions is illustrated in Table 1. 


Name 

Syntax 

Semantics 

concept 

A 

A 1 C Zl 1 

ab. (resp., cn.) rl. 

R (resp., P ) 

R 1 C A 1 x A 1 (resp., P 1 C A 1 x ZA D ) 

individual 

a 

a 1 £ zA 1 

nominal 

W 

{a} 1 = {a 1 } 

dtype (resp., ng.) 

d (resp., -■d) 

d D C Z\d (resp., ZAd \ d D ) 

negative 
datatype term 

~tii 

(-iti) D = ZAd \ f? 

datatype terms 
intersection 

ti n t 2 

(pn t 2 )° = t?n*? 

datatype terms 
union 

ti U t 2 

(ti u t 2 )° = tf 

constant in 
N c (d) 

ea 

e? € d D 

data range 

{ e di , ■ ■ 

• i Zd n } 

{e dl ,...,e d „} u = { e y i }U...U{eyj 

data range 

V'd 


data range 

-i dr 

zi D \ dr u 

top (resp., bot.) 

T (resp., _L ) 

zA l (resp., 0) 

negation 

->c 

(-nC) 1 = ZA 1 \ C 

conj. (resp., disj.) 

C n D (resp., C U D) 

(CHD) 1 = C 1 HD 1 (resp., (CUD) 1 = C’uD 1 ) 

valued exist, 
quantification 

3 R.a 

(3P.O) 1 = {* G ZA 1 : (a:, a 1 } € R 1 } 

datatyped exist, 
quantif. 

3 P.e d 

(IP.ed) 1 = e zA 1 : (x,e?) £ P 1 } 

self concept 

3 R.Self 

(3 R.Self Y = {x£A*: ( x,x } £ R 1 } 

nominals 

{m,.. 

• t a n} 

{ai, . . . , an} 1 = {a}} U . . . U {a* } 

universal role 

U 

(U) 1 = A l x A 1 

inverse role 

R~ 

(R-) 1 = {{y,x) | (x,y) £ R 1 } 

concept cart, 
prod. 

Ci x C 2 

(Ci x c 2 y = C{ x G{ 

abstract role 
complement 

-•P 

(-^P) 1 = (zA 1 x zA 1 ) \ R 1 

abstract role 
union 

Rl U R 2 

(Pi U R 2 y = R\ U Rl 

abstract role 
intersection 

Ri n R 2 

(Pi n P 2) 1 = R\ n Rl 

abstract role 
domain restr. 

Rc\ 

(Rc\Y = {{x,y} £ R 1 :x£ C 1 } 

concrete role 
complement 

^p 

(-P) 1 = (ZA 1 x ZA d )\P i 



concrete role 
union 

concrete role 
intersection 
concrete role 
domain restr. 
concrete role 
range restr. 
concrete role 
restriction 


Pi U P2 
Pi n P 2 
Pci 
Pit 

p Cl \t 


(Pi u P2) 1 = Pi 1 u Pi 
(Pi n P2) 1 = Pi n Pi 
(Pctf = {{x,y} £ P 1 : x £ C 1 } 

Wt) I ={(x,y)6P I :»6t D } 

(Pc^t? = {(x,y) £ P 1 : x £ Cl Ay £ t°} 


concept subsum. 
ab. role subsum. 
role incl. axiom 
cn. role subsum. 

Ci C C 2 
Ri ^ R 2 
Ri . . . R n CI R 
Pl^P2 

I |= D C 1 FC 2 c i c cl 

I |= D R 1 QR 2 <=> R\ C R\ 

I |=D Pi • • • Rn C R •$=$• Pi O ... O Rl C R 1 
I |= D Pi C P 2 ^ Pi 1 C P2 1 

symmetric role 
asymmetric role 
transitive role 
disj. ab. role 
reflexive role 
irreflexive role 

func. ab. role 

disj. cn. role 

func. cn. role 

Sym (R) 
Asym (R) 
Tra (R) 
Dis(Pi, P 2 ) 
Ref(P) 
Irref(P) 

Fun(P) 

Dis(Pi,P 2 ) 

Fun(P) 

I |=D Sym(P) (R-) 1 C P 1 

I [= D Asym(P) <=> R 1 n (P ”) 1 = 0 
I |= D Tra(P) P'ofl'C R 1 

I |= D Dis(Pi, P 2 ) <t=» Pj n Rl = 0 
I |= D Ref(P) {(*,*} | x £ A 1 } C R 1 

I |=d Irref(P) •<=>■ P 1 n {( x,x ) \ x £ A 1 } = 0 
I |=d Fun(P) (P -) 1 0 R 1 C {(*, x) | x £ 

A 1 } 

I |= D Dis(Pi, P 2 ) 4=^ Pi 1 n P 2 r = 0 
I (=d Fun(p) <t=>- ( x,y ) £ P 1 and ( x,z ) G 
P 1 imply y = z 

datatype terms 
equivalence 
datatype terms 
diseq. 

datatype terms 
subsum. 

ti = t 2 
ti ^ t 2 
ti c t 2 

I |=D ti = t 2 •<=£■ tj 3 = tp 

I |=d ti ^ t 2 <==? tj 3 ^ tY 

1 |=D (ti c t 2 ) 

concept assertion 

a : Ci 

I |= D a : Ci <=* (a 1 £ C{) 

agreement 

a = b 

I |= D a = b a 1 = b 1 

disagreement 

a A b 

I |=d cl A b •<=>■ _ '(tt I = fe 1 ) 

ab. role asser. 

(a, b) : R 

I |= D (a, b) : P -4=> (a 1 , b 1 ) £ R 1 

cn. role asser. 

(a, e d ) : P 

Ih=D (a,e d ):P ^ (a\eY) £ P 1 


Table 1: Semantics of T>Ct 


Legenda. ab: abstract, cn.: concrete, rl. : role, ind.: individual, d. cs.: 
datatype constant, dtype: datatype, ng.: negated, bot.: bottom, incl.: 
inclusion, asser.: assertion. 


Let KB = (1Z,T,A) be a -knowledge base. An interpretation I = 

(Z^ZW 1 ) is a D-model of 1Z (and write I |=d P) if I satisfies each axiom in 
TZ according to the semantic rules in [5, Table 1]. Similar definitions hold for T 
and A too. Then I satisfies KB (and write I |=d KB) if it is a D-model of TZ, T, 
and A. A knowledge base is consistent if it is satisfied by some interpretation. 



3 Conjunctive Query Answering for T>jC^ x 


Let V = {i>i, V 2 , . . .} be a denumerable and infinite set of variables disjoint from 
Ind and from |J{./Vc(<i) : d £ No}- A -atomic formula is an expression of 

of the following types 

R(wi,W 2 ), P(wi,Ui), C(w 1 ), 101 = 102 , Ui=U2, 

where 101,102 G V U Ind, 111,112 G V U (J{iVc(d) : d G No}, R is a VC^ff - 
abstract role term, P is a VC^ff -concrete role term, and C is a -concept 
term. A T>C^ -atomic formula containing no variables is said to be closed. A 
VC^f -literal is a VC^ff -atomic formula or its negation. A -conjunctive 

query is a conjunction of VC^ -literals. Let v\,...,v n G V and 0 \,...,o n G 
Ind U |J{A r c((i) : d G No}- A substitution a := {v\/o \, . . . , v n /o n } is a map 
such that, for every DC^ff -literal L, La is obtained from L by replacing the 
occurrences of v\, . . . , v n in L with 01 , ... ,o n , respectively. Substitutions can be 
extended to -conjunctive queries in the usual way. Let Q := (L\ A ... A L m ) 
be a 2?£ D X -conjunctive query, and K.B a V £ D X -knowledge base. A substitution 
a involving exactly the variables occurring in Q is a solution for Q w.r.t. K.B 
if there exists a -interpretation I such that I NB and I f=D Qcr. The 

collection £ of the solutions for Q w.r.t. ICB is the answer set of Q w.r.t. ICB. 
Then the conjuntive query answering (CQA) problem for Q w.r.t. ICB consists 
in finding the answer set £ of Q w.r.t. ICB. 

We shall solve the CQA problem just stated by reducing it to the analogous 
problem formulated in the context of the fragment 4LQS R (and in turn to the 
decision procedure for 4LQS R presented in [4]). The CQA problem for 4LQS R - 
formulae can be stated as follows. Let <j> be a 4LQS R -formula and let if be a 
conjunction of 4LQS R -quantifier-free atomic formulae of level 0 of the types 

x = y, x £ X 1 , (x,y) £ X 3 

or their negations, such that Var 0 (if) D Var 0 (</>) = 0 and Vari (if) U Var 3 (^>) C 
Vari((0) U Var 3 (^6) . The CQA problem for if w.r.t. (f consists in computing the 
answer set of if w.r.t. <f, namely the collection £' of all the substitutions a' := 
{xi/yi, ■ ■ ■ , x n /y n } (where xi, ... ,x n are the distinct variables of level 0 in if and 
{ 2/1 > • • • , y n } Q Var o (<0)) such that A4 |= (fAifa', for some 4LQS R -interpretation 
Ai. In view of the decidability of the satisfiability problem for 4LQS R -formulae, 
the CQA problem for 4LQS R -formulae can be solved effectively. Indeed, given 
two 4LQS R -formulae (f and if satisfying the above requirements, to compute the 
answer set of if w.r.t. (f , for each candidate substitution a' := {x\/y 1 , . . . , x n /y n } 
(with {#i, . . . , x n } = Varo (if) and {y \, . . . , y n } C Varo ((f)) one has just to test 
for satisfiability the 4LQS R -formula <fi A ifa'. Since the number of possible can- 
didate substitutions is |Varo(^)|l Var °^^ and the satisfiability test for 4LQS R - 
formulae can be carried out in an effective manner, the answer set of if w.r.t. cf 
can be computed effectively. Summarizing, 



Lemma 1 . The CQA problem for 4LQS R -formulae can be solved in an effective 
manner. □ 

The following theorem states that also the CQA problem for VC^ X can be 
solved effectively. 

Theorem 1 . Given a VC^ff -knowledge base KB and a VC^ff -conjunctive query 
Q, the CQA problem for Q w.r.t. ICB can be solved in an effective manner. 

We first outline the main ideas and then we provide a formal proof of the theo- 
rem. 

As remarked above, the CQA problem for VC ^j x can be solved via an effective 
reduction to the CQA problem for 4LQS R -formulae, and then exploiting Lemma 
1. The reduction is accomplished through a function 6 that maps effectively 
variables in V, individuals in Ind, datatype constants in |J{./Vc(d) : d G N&} 
into variables of sort 0 (of the 4LQS R -language), etc., VC^f -TBoxes, -RBoxes, 
and - ABoxes , and VC^ -conjunctive queries into 4LQS R -formulae in conjunctive 
normal form (CNF), which can be used to map effectively CQA problems from 
the -context into the 4LQS R -context. More specifically, given a VC^ff - 

knowledge base KB and a VCff -conjunctive query Q, using the function 9 we 
can effectively construct the following 4LQS R -formulae in CNF: 

<t>KB := /\hgk.B 0(H) A Ai=l V ’ Q := d(Q ) .“ 

Then, if we denote by S the answer set of Q w.r.t. KB and by S' the answer 
set of %f>Q w.r.t. (f>icB , we have that S consists of all substitutions er (involving 
exactly the variables occurring in Q ) such that 9(a) G S'. Since, by Lemma 1, 
S' can be computed effectively, then S can be computed effectively too. 

We are now ready to provide the proof of Theorem 1. 

Proof. As preliminary step, observe that the statements of KB that need to be 
considered are the following: 

- Ci. = T, C x = -C 2 , C x = C 2 U C 3 , C x = {a}, C x E VPi.C 2 , 3 Pi-Ci E C 2 , 
>»Pi-Ci E C 2 , Ci E < n Ri.C 2 , Ci E VPi.ii, BPi-Q E Ci, >„Pi.C E C u 
Ci E <nPi.ii, 

- Pi = U, Pi = ^P 2 , Pi = P 2 U P 3 , Pi = P 2 , Pi = id(Ci), Pi = P 2ci p 
Pi . . . P„ E Pn+i, Ref(Pi), I rref (Pi ) , Dis(Pi, P 2 ), Fun(Pi), P x = Ci x C 2 , 

- Pl = P 2 , Pi = ~>P 2 , Pi = P 2 U P 3 , Pi E P 2 , Fun(Pi), Pi = P 2ci| , Pi = 

^ 2 a x m ’ = ^ 2 lti ’ 

- a : Ci, (a, b) : Pi, (a, b) : — 'Pi , a = b, a b, 

2 The definition of the function 9 is inspired to that of the function r introduced in the 
proof of Theorem 1 in [7] . Specifically, 9 differs from r as (i) it allows quantification 
only on variables of level 0, (ii) it treats Boolean operations on concrete roles and 
the product of concepts, and (iii) it constructs 4LQS R -formulae in CNF. In addition, 
the constraints Q-£i 2 are similar to the constraints introduced in the proof 

of Theorem 1 in [7]; they are introduced to guarantee that each model of 4 > k.b can 
be transformed into a -interpretation. 



- e d : ti, (a, e d ) : Pi, (a,e d ) : --Pi. 


We solve the problem of CQA for via a reduction to the problem of 

CQA for 4LQS r , exploiting the decidability result proved in Lemma 1. 

We define a function 9 that maps the VC ^ -knowledge base JCB and the 
-conjunctive query Q in the 4LQS R -formulae in Conjunctive Normal Form 
(CNF) (f>icB and ipQ, respectively, and the answer set £ for Q w.r.t. ICB in a set 
£' of (0 level) substitutions in the 4LQS R formalism. 

We will show that, £ is the answer set for Q w.r.t. ICB iff £ is equal to 
£' = |J £’ M t, where £' M is the collection of substitutions a' such that 

A-t( =<t’Kl3 

M |= iflQtj' . 

The definition of the mapping 9 is inspired to the definition of the mapping r 
introduced in the proof of Theorem 1 in [7]. Specifically, 9 differs from r because 
it allows quantification only on variables of level 0, it treats Boolean operations 
on concrete roles and the product of concepts, and it construct 4LQS R -formulae 
in CNF. To prepare for the definition of 9 , we map injectively individuals a, 
constants e d £ Nc{d), and variable y, z, . . . € V, into level 0 variables x a , x ed , 
x y , x z , the constant concepts T and A, datatype terms t, and concept terms C 
into level 1 variables X^-, Xj_, X t 3 , Xq, respectively, and the universal relation 
on individuals U, abstract role terms R , and concrete role terms P into level 3 
variables X p , X p , and X|>, respectively. 3 

Then the mapping 9 is defined as follows: 


0(Ci = T) := (V*)(H* eX^)V^X{) A H* G X 3 ) V 2 G X^)), 

9(Ci = C 2 ) '■= (V 2)((-'(2 G Xq^) V -i(z G X^ 2 )) A (z G Xc 2 \J z £ X^J), 

9 (Ci = C2UC3) := (Vz)((—’(z £ X^ i )V(z £ Xq 2 V z £ Xq 3 ))/\ ((-'(z £ Xq 2 )V z £ 
X^)A nz£X^)Vz£X^ i )), 

9(C 1 = {a}) := (Vz)(~'(z £ X^J V z — x a ) A (->(2 = x a ) V 2 G X^J, 

9{Ci E Vi?!.C 2 ) := (\/z 1 )(\/z 2 )(^(z 1 £ X l Ci ) V (-.(( 21 , 2 a) £ X 3 J V 2 2 G X^)), 
9(3R\.Ci E £ 2 ) '■= i^zi)<^z 2 ){{->{{zi, z 2 ) £ X Pi ) V ~<(z 2 £ X^)) V z\ £ X^ 2 ), 
9{Ci = 3R 1 .{a}) := (\/z)((->(z £ X^ i )y \z,x a ) £ Xj^)A {->((z,x a ) £ X|JV 2 £ 

^))> 

n+1 

9{Ci E < n Ri-C 2 ) := (V«)(V«i)...(V^ +1 )(-.(z G X^) V ( A G X C J V 
~^{{z, Zi) £ X Pi ) V \J Zi = Zj)), 

i<j 

n 

9{> n Ri-Ci E C 2 ) := (V^XVzi) . . . (V 2 n )( A (Hzi £ X&J V ^((z, Zi ) £ X 3 J) V 

i = 1 

V Zi = Zj) V 2 G X^ 2 ), 
i<j 

9{Ci EVPi.ii) := (VziXV^H*! G X^) V (-.((21,22) G X 3 Pi )Vz 2 £ X t \)), 
9{3P\.ti E Ci) := (V 2 i)(Vz 2 )((-.((zi, z 2 ) £ X Pi ) V ~<(z 2 £ X^)) V Zi £ X^), 


3 The use of level 3 variables to model abstract and concrete role terms is motivated by 
the fact that their elements, that is ordered pairs (x, y ), are encoded in Kuratowski’s 
style as { {as}, {x, j/}} , namely as collections of sets of objects. 



6 (Ci = 3 P 1 .{e d }) := (V2)((->(2 G X^) V (2,a; e<J ) G XJ.J A (--((2,a; e<J ) G Xf,J V 

2 e ^1)), 

n+1 

0 (C 1 C<„Pi.ti) := (V2)(V2i)...(V*^+i)(-.(z G X^) V ( A (“■(* G X tl )V 

i=l 

-((2,^) G XJ.J v \J Zi = zj)), 
i<j 

n 

0 (> n Pl-tl E Ci) := (V2)(V2i)...(V2 n )(A(K^ G X t \) V^((z,Zi) G X 3 J)V 

i=l 

\l z i = Zj) V 2 G XlJ, 

i<j 

9(Ri = £ 7 ) := (V21 ) (V22) ( ( '((21 , 22} G X 3 J v (21,22) G Xp) A (->((21,22) G 
X 3 ) v (21,22) GXIJ), 

0 (i?i = ^P 2 ) ■= (V21 ) (V 2 2 )((— >((21, 2 2 ) G X|J V ->((21,22) G X| 2 )) A ((21,22) G 
^Vn^^G^))), 

0 (i? E Cl X C 2 ) := (V2 i)(V2 2 )(->((2i,2 2 ) G X|) V 21 G X^J A (-.((21,22) G 
Xr) V 22 G X^ 2 ) a ((->(21 G X£J V ->(22 G Xp 2 )) V (21, 22) G X|)) 

0 (i?i = i?2 ui? 3 ) := (V2 i)(V2 2 )((->((2i,2 2 ) G X|J V ((21,22) G X | 2 V (21,22) G 
Xr 3 )) a (C _1 ((zi,z 2 ) G X| 2 ) V (21,22) G X|J A ((->((21,22) G X 3 j V (2 i,2 2 ) G 

6 »(i?i = i?2 ) := (V2i)(V2 2 )((->((2i,2 2 ) G X|J V (22,21) G X|J A (-.((22,21) G 
-XpJ v z z) € 

6»(i?i = id(Ci)) := (V2i)(V2 2 )(((->((2i,2 2 ) G X|J V 21 G X^J A (->((21,22) G 
XflJ v 22 G XpJ A (->((21,22) G X 3 J V 21 = 22)) A ((->(21 G X^) V ->(22 G 
V21 A 22) v (21,22) G X|J), 

6<(i?i = i?2 Cll ) : = (V 2 i)(V 2 2 )(((->(( 2 i, 2 2 ) € X^)V (2i,2 2 ) G X 3 2 )AH( 2 i, 2 2 ) G 
-XflJ V 2i G X^)) A ((-i((2i, 2 2 ) G X| 2 ) V ->(21 G x^)) V (21,22) G xj^)), 

0 (i?i . . . R n E -Rn+l) := (V2)(V2i) . . . (V2„)((->((2, 2i) G X|JV. . .V->((2 n _i, Z n ) G 
4 j)V( 2 , 2 n )GX| n+i ), 

0 (Ref(i?i)) := (V2)((2,2)GX| i ), 

0 (lrref(i?i)) := (V2)(->((2, 2) G Xj^)), 

6>(Fun(i?i)) := (V2 i)(V2 2 )(V2 3 )((->((2i, 2 2 ) G X| i )V->((2i, 23) G X| i ))V2 2 = 23), 
0 (Pi = P 2 ) := (V21 ) (V22) ( (— ’((21 , 2 2 ) G X 3 J V (21,22) G Xp 2 ) A (->((21,22) G 
Xp 2 ) V ( 2 i, 2 2 ) G X 3 J), 

0 (Pi = ^P 2 ) := (V 21 ) (V 2 2 )((— >((21, 2 2 ) G X 3 Pi ) V ->((21,22) G Xf> 2 )) A ((21,22) G 
X^v (21,22) GX^)), 

9 {Pl E P 2 ) := (V2i)(V2 2 )(-((2i,2 2 ) G Xf,J V (21,22) G X|, 2 ), 

6>(Fun(Pi)) := (V2 i)(V2 2 )(V2 3 )((->((2i, 2 2 ) G Xf^) V ->((21, 23) G XJ.J V2 2 = 23), 
0 (Pi = P 2ci| ) := (V2i)(V2 2 )((->((2i,2 2 ) G XJ.J V (21,22) G Xf> 2 ) A (-.((21,22) G 
X|>i) V 21 G X£j A ((->(21, 22) G Xp 2 ) V ->(21 G X^J V (21, 22) G XpJ), 

0 (Pi = P 2|tl ) : = (V2 i)(V2 2 )((->((2i, 2 2 ) G XpJ V (21,22) G Xp 2 ) A (“>((2 i,2 2 ) G 
^pj v ~2 e -X’ti) A (( _, ((^i> *2) € X|> 2 ) V ->(22 G XjJ) V (2 i,2 2 ) G X^)), 

0 (Pi = P 2ci|tl ) := (V2 i)(V2 2 )((->((2i, 2 2 ) G XJ.J V (21,22) G Xf, 2 ) A(-.((2i,2 2 ) G 
Xpx) V 2i G X&J A (—>((21, 2 2 ) G X Pi ) V 22 G XjJ A (->((21, 22) G Xp 2 ) V ->(21 G 
-^cj V ~‘( z 2 G XjJ V (21, 22) G XJ.J), 



9(t\ = t 2 ) ■= (Vz)((—'(z G X^) V z G X} 2 ) A (~<(z G Xf 2 ) V z G X4)), 0(fi = 
- , t 2 ) : = (V2)((->(2 G X} ± ) V ->( 2 : G X4)) A (zG V 2 G X^J), 

0(ii = f 2 U * 3 ) : = ('^((“'C 2 G X^) V (z G X4 V 2 G X/J) A ((->(2 G X4) V 2 G 

x4)aH2gx4)v2gx4))), 

9(ti = t 2 nt 3 ) := (V 2 ) ((— ’(2 G X4)V (2 G X} 2 Az G Xf s ))A(((-<(z G X4) V-i (2 G 
■X'ts)) V 2 G X t \)), 

6(ti = {e d }) := (V 2) ((— >(2 G X t \) V 2 = x ed ) A (-.(2 = ayj V 2 G X/J), 

%:Ci) := i„g4, 

0((M) : i?i) := {x a ,x b ) G X| i5 
9((a,b) : ~^Ri) := -^{{x a ,x b ) G Xj^), 

0 (a = b) := x 0 = x b , 9 {a 7 ^ 6 ) := ->(x a = x b ), 

9{e d ■ ti) •= x ed G X tl , 

9({a,e d ) : Pi) := {x a ,x ed ) G Xj^, 9((a,e d ) ■ ->Pi) := ~^({x a ,x ed ) G XjLj, 
%A|?) := 0(a)A0(/3). 

The mapping 6* for 2?£4 X ’ con j uc ti ve queries is defined as follows. 

9(Ri(wi,w 2 )) := (x 

Wi j ^102 ) 

0(Pi(u;i,Ui)) := ( 

5 *^iii )ex4 i , 

9(Ci(»i) := G X£ t , 

0(«h = iu 2 ) := x Wl = x W2 , 

9{ui = u 2 ) ■.= x Ul = x U2 . 


To complete, we extend the mapping 9 on substitutions a := {x\/oi, . . . , x n /o n }, 
where Xi, . . ,x n G V and 01 , . . . , o n G Ind U U{^ r c(^) : d G X D }. 

We put 0(cr)= 0({xi/oi, . . . ,x n /o„}) = {x Xl /x 0l ,. . . ,x Xfl /x 0n } = a' , where 
x Xl , ... ,x n , x 0l , ■ ■ ■ , x 0rl are variables of level 0 in 4LQS R . 

Let KB be our VC ^ -knowledge base, and let cpt KB , arl^g, crl/ce, and ind^g 
be, respectively, the sets of concept, of abstract role, of concrete role, and of 
individual names in KB. Moreover, let N^ B C N F be the set of datatypes in 
KB, N^ B a restriction of N F assigning to every d G N^; B the set N^ B (d) of 
facets in Np{d) and in KB. Analogously, let Nq B be a restriction of the function 
Nq associating to every d G N^ B the set N^ B (d) of constants contained in 
Nc(d) and in KB. Finally, for every datatype d G N^ B , let bf° e (d) be the set 
of facet expressions for d occurring in KB and not in Np{d) U {T d , T^}. We 
assume without loss of generality that the facet expressions in bfJ? B (d) are in 
Conjunctive Normal Form. We define the 4LQS R -formula 4>ict3 expressing the 
consistency of KB as follows: 
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ok:* A 0{H)A/\Zi, 

HefCB tea 

where 

£1 := (V 2)((— >(2 G Xj 1 ) V - 1(2 G X4)) A (2 G X'4 V 2 G Xj 1 )) A (Vz)(z G 

Xj 1 V 2 G X4) A ~'(Vz)~ i (z G Xj 1 ) A -n(Vz)^(z G X^), 



£ 2 : = ((Vz)((“ 1(2; G Xj 1 ) V 2 G Xy) A (->(z G Xy) V2G Xj 1 )) A (\/z)^(z G 


*±), 


6 := A (V^H^GX^VzGXA 

A& C pt/ce 

&:=( A ((VA^GXjJVzGXiJAAVzHzGXjOAfVz) 

deJVg B 

( A ((-’(z G X].) V ->(z G X])) A (2 G X\ V 2 G X].)))), 

(d i ,d j €Ng B ,i<j) 

&:= A ((V2)(A(2GX])V 2 GX1JAH2GX}JV2GX> 

deivg 6 

(V*M* G XiJ), 


£e := A NH^GXJJVzGX*), 

fd.eN£ B (d), 

dGN™ 

£7 := (V2i)(V2 2 )((^(2i G Xj 1 ) V ->(z 2 G Xj 1 ) V (z 1 ,z 2 ) G Xg.) A ((->((21, z 2 ) G 

Xy) v 2i G Xj 1 ) A (“>((21, 22) € Xy-) V 22 G Xj 1 ))), 


& : = A (V2i)(V2 2 )((^((2i, 2 2 ) G X|) V 21 G Xj 1 ) A (->((21,22) G X|) V 22 G 

fl 6 arlye 


£9 : — A (V 2 i)(V 2 2 )(->(( 2 i, 2 2 ) G Xy) V 2 i G Xj 1 ) A ( _i (( 2 i, 2 2 ) G Xy) V 2 2 G 
T’Gcrl^s 

*d)))> 


60 A (^a S -^i) A A € X], 

aGinti/czs d£N% B , 

(d) 

61 : = A (MH^xj ^(IvV^^DAfA^ 

{e dl ,.~,e dn }in/CB 1 ” "- 1 


i = 1 


i=l 


a: edi V 2 G X* ... >edn} ))) A A G -^ { 1 aii ..., a „ } )V 

{ai,...,a n } in JCt3 
n n 

V (z = X ai )) A(A(^^,V 2 G *{ ai ,. ..,»„})))> 

i = 1 i=l 

£12 : = A (Vz)(-i(2 G Xjj V 2 G C(X^)) A (-1(2 G C(X^)) V 2 G xjj 

dGNg B , 

V'd6bfg e (d) 



with C the transformation function from 4 LQS R -variables of level 1 to 4 LQS R - 
formulae recursively dehned, for d G , by 

[Xl d ifil> d £N™(d)U{T d ,A d } 

I ^ X xd) if 4 >d = ^Xd 

|C(^xd) A C(^J if V’d = XdA^ d 

U(^xJvC(^J if i>d = Xd v <Pd ■ 

In the above formulae, the variable X j 1 denotes the set of individuals Ind, X\ a 
datatype d G N^ B , X ^ a superset of the union of datatypes in N^ B , X\ d and 
X]_ d the constants T d and Ad, and Xj d , X^ d a facet fd and a facet expression ipd, 
for d G N™, respectively. In addition, X\, X Xj, denote a concept name A , an 
abstract role name R, and a concrete role name T occurring in KB, respectively. 
Finally, X^ £d ed j denotes a data range {e^, . . . , ed n } occurring in KB, and 
X{ ai a } a fi n if e set {ni , . . . , a n } of nominals in KB. 

The constraints £1— £12, slightly different from the constraints Vh— Vh2 defined 
in the proof of Theorem 1 in [ 7 ], are introduced to guarantee that each model of 
(frtCB can be easily transformed in a T>C^ -interpretation. To prove the theorem, 
we show that S is the answer set for Q w.r.t. KB iff S is equal to U 

where is the collection of substitutions a such that Ad |= ip qct . 

Preliminarly we show that if Ad is a 4 LQS R -interpretation such that Ad j= 
<t>KB, w e can construct a 'Z^jC^j-intcrprcta.tion Ijvt such that Ijvt |=d KB and, 
if I is a ^/^-interpretation suc h that I |=d X.B. we can construct a 4 LQS R - 
interpretation Adi such that Adi |= 4 >k.b- Thus, let Ad be any 4 LQS R -interpretation 
Ad such that Ad |= 4 >k.b- Reasoning as in [ 7 ], it is not hard to see that such Ad 
is a 4 LQS R -interpretation of the form Ad = {D\ U D2, M), where 

- Di and D 2 are disjoint nonempty sets and (J d D C D 2 , 

d,£Ng 

- MX j 1 := Di, Mlp := D 2 , Ml] := d D , for every d G Np, 

- MXj d := /J 3 , for every f d G N${d), with d£ N%. 

Exploiting the fact that Ad satisfies the constrains £1 — £12 , it is then possible 
to define a VC ^ -interpretation Im = (A 1 , Ad, - 1 ) , by putting 

- A 1 := MXj, 

- A d := MAp, 

— A 1 := MX \ , for every concept name A G cpt^g, 

— S 1 := {{u\,u 2 ) : u\ G MXj,u 2 G MX }, ( ui,u 2 ) G MX J}, for every abstract 
role name S G arl^e, 

— T 1 := {(111,112) : u\ G MXj,u 2 G MXjj,{ui,u 2 ) G MXf.}, for every con- 
crete role name T € crl^g, 

— a 1 := M x a , for every individual a G induce, 

— e® '■= Mx ed , for every constant ed G NQ B (d) with d G Np B . 


■= 
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Since M \= 9(H) A /\ and, as it can easily checked, I m |=d H, iff M |= 
HeKB i = 1 

0(H), for every statement H G /CB, we plainly have that I^vi |=d /CB. Conversely, 
let I = (A 1 , Ad,- 1 ) be a VC^fr -interpretation such that I |=d /CB. We show 
how to construct, out of the datatype map D and the V £^ x -interpretation I, 
a 4LQS R -interpretation .Mld = (-Di,d, JWi.d) which satisfies (fries- Let us put 
Dix> '■= A 1 U Z\d and define -Mld by putting Mid Vj 1 := A 1 , := Ad, 

Mi d X fj := U 1 , := d?’ D , for every variable X\ r in (fries denoting a 

data range dr occurring in /CB, M\j>X\ := A 1 , for every X\ in (fries denoting 
a concept name in /CB, and Af ID dC| := S' 1 , for every in (fries denoting an 
abstract role name in A CB. Variables Xj,, denoting concrete role names, and 
variables x a ,x ed , denoting individuals and datatype constants, respectively, are 
interpreted in a similar way. From the definitions of D and I, it follows easily 
that Mi d satisfies the formulae £i-£i 2 and 6(H), for every statement H € /CB, 
and, therefore, that Min is a model for (frje B- 

Now we prove the first part of the theorem. Let us assume that E is the 
answer set for Q w.r.t. /CB. We have to show that E is equal to r'= u 

M-\=4>k.b 

where E' M is the collection of all the substitutions a' such that M. |= i/)qc t' . 

By contradiction, let us assume that there exists a a G E such that a ^ E' , 
namely M ifrQCF, for every 4LQS R -interpretation M with M |= (fries- Since 
a G E there is a -interpretation I such that I |=d /CB and I |=d Q&- Then, 
by the construction above, we can define a 4LQS R -interpretation M-i such that 
Mi \= (fries and Mi \= ^qOu. Absurd. 

Conversely, let a' G E' and assume by contradiction that a' £ E. Then, 
for all -interpretations such that I \=d /CB, it holds that 1^=d Qv' ■ Since 
a' G E ' , there is a 4LQS R -interpretation M such that M |= (fries an d M \= ifra'. 
Then, by the construction above, we can define a 2? /^-interpretation I^vt such 
that Ijvt | =d /CB and Ijvt \=d Q&' ■ Absurd. 

4 A tableau-based procedure 

In this section, we illustrate a KE-tableau based procedure that, given a 4LQS R - 
formula (fries corresponding to a VC^fr -knowledge base and a 4LQS R -formula 
ipQ corresponding to a VC-^ -conjunctive query Q, yields all the substitutions 
a = {xi/yi, ■ ■■ ,x n /y n }, with {xi,...,x n } = Var 0 (V’Q) and {y 1 ,...,y n } Q 
Varo ((fries) i belonging to the answer set E' of i/jq w.r.t. (fries- 
Let (fr^ be the formula obtained from (fries by: 

- moving universal quantifiers in (fries as inwards as possible according to the 
rule <friz)(A(z) A B(z)) < — » ((Vz)A(z) A (\/z)B(z)), 

- renaming universally quantified variables so as to make them pairwise dis- 
tinct. 

Let Fi, . . . , Fk be the conjuncts of (fries that are 4LQS R -quantifier-free atomic 
formulae and Si, ... , S m the conjuncts of (fries that are 4LQS R -purely universal 
formulae. For every Si = (Vzj) . . . (V^.)x», i = 1, . . . , m, we put 



Exp(Si) :■ 


A _ Si{z\/Xa i ,...-,Z l n ./Xa n .}. 

{x ai ,...,Xa n . }CVar 0 (<t> K s) 
m 

Let de '■= {Fj : i = 1, . . . , k} U [J Exp(Si). 

i—1 

To prepare for the KE-tableau based procedure to be described next, we 
introduce some useful notions and notations (see [11] for a detailed overview of 
KE-tableau, an optimized variant of semantic tableaux). 

Let = {Ci, . . . , C p } be a collection of disjunctions of 4LQS R -quantifier-free 
atomic formulae of level 0 of the types: x = y, x £ X 1 , (x,y) £ X 3 . T is a 
KE-tableau for <P if there exists a finite sequence 7i , . . . , 7} such that (i) T\ is 
a one-branch tree consisting of the sequence Ci, . . . , C p , (ii) 7* = T, and (iii) 
for each i < t, 7}+i is obtained from 7) by an application of one of the rules in 
Fig 1. The set of formulae <Sf = {ft, . . . , ft} \ {ft} occurring as premise in the 
E-rule contains the complements of all the components of the formula (3 with 
the exception of the component ft. 

ft V ••• V ft S? 

a - E-Rule — PB-Rule 

ft A I A 

where <Sf := {ft, ..., p n } \ {ft}, with A a literal 

for i = 1, ..., n 

Fig. 1. Expansion rules for the KE-tableau. 

Let T be a KE-tableau. A branch 6 of T is closed if it contains both A and 
sjA, for some formula A. Otherwise, the branch is open. A formula ft V ... V p n is 
fulfilled in a branch ft if Pi is in 9, for some i = 1, . . . , n. A branch d is complete 
if every formula ft V ... V ft occurring in i) is fulfilled. A KE-tableau is complete 
if all its branches are complete. 

Next we introduce the procedure Saturate-KB that takes as input the set 
Fkb constructed from a 4LQS R -formula representing a -knowledge 

base tCB as shown above, and yields a complete KE-tableau Ties for de- 
procedure 1 Saturate-KB (d> kb) 

1- 7 kb •= del 

2. Select an open branch d of Tkb that is not yet complete. 

(a) Select a formula ft V ... V ft on d that is not fulfilled. 

(b) If Sj is in d, for some j £ {1, . . . , n}, apply the E-Ride to ft V ... V ft 
and S? on d and go to step 2. 

(c) If Sj is not in ft for every j = 1 let be the collection of 

formulae ft, . . . ,p n present in d and let ft be the lowest index formula 
such that ft £ {{ft, . . . , ft} \ B 13 }, then apply the PB-nde to ft on d, 
and go to step 2. 



3. Return Ticb- 

Soundness of Procedure 1 can be easily proved in a standard way and its 
completeness can be shown much along the lines of Proposition 36 in [11], Con- 
cerning termination of Procedure 1, our proof is based on the following two facts. 
The rules in Fig. 1 are applied only to non-fulfilled formulae on open branches 
and tend to reduce the number of non-fulfilled formulae occurring on the consid- 
ered branch. In particular, when the E-Rule is applied on a branch d, the number 
of non-fulfilled formulae on d decreases. In case of application of the PB-Rule 
on a formula /3 = /3i V . . . V /?„ on a branch, the rule generates two branches. In 
one of them the number of non-fulfilled formulae decreases (because /3 becomes 
fulfilled). In the other one the number of non-fulfilled formulae stays constant 
but the subset B 13 of . . . , /3 n } occurring on the branch gains a new element. 
Once \B i3 \ gets equal to n — 1, namely after at most n — 1 applications of the 
PB-rule, the E-rule is applied and the formula (3 = fix V . . . V/3„ becomes fulfilled, 
thus decrementing the number of non-fulfilled formulae on the branch. Since the 
number of non-fulfilled formulae on each open branch gets equal to zero after a 
finite number of steps and the rules of Fig. 1 can be applied only to non-fulfilled 
formulae on open branches, the procedure terminates. 

By the completeness of Procedure 1, each branch d of Ticb induces a 4LQS R - 
interpretation AT,? such that AT,? |= <S>k.b- We define AT,? = (Do, M$) as follows. 
We put 

- Do := {x £ Vo : x occurs in d}\ 

- M{)X := x, for every x £ Do; 

- M.$Xc = {x : x £ Xq is in d}, for every Xq £ V? occurring d; 

- M.$Xr = {(x,y) : {x,y} £ X ^ is in d}, for every X^ £ V 3 occurring in d. 

It is easy to check that A \= (j)^ and thus, plainly, that Ai^ |= 4>kb- 
Next, we provide some complexity results. Let r be the maximum number of 
universal quantifiers in S'? , and k := |Varo (</>£#) |. Then, each Si generates k r 
expansions. Since the knowledge base contains m such formulae, the number of 
disjunctions in the initial branch of the KE-tableau is m ■ k r . Next, let t be the 
maximum number of literals in Si, for i = 1, . . . , m. Then, the maximum depth 
of the KE-tableau, namely the maximum size of the models of < Pk.b constructed 
as illustrated above, is (D(imk r ) and the number of leaves of the tableau, that 
is the number of such models of ^eg, is 0( 2 emkr ). 

We now describe a procedure that, given a KE-tableau constructed by Pro- 
cedure 1 and a 4LQS R -formula i/jq representing a -conjunctive query Q, 

yields all the substitutions o' in the answer set S' of ipQ w.r.t. (fries- By the 
soundness of Procedure 1, we can limit ourselves to consider only the models 
AT,? of <f>icB induced by each open branch $ of Ticb ■ For every open and com- 
plete branch i? of Tie b, we construct a decision tree T>.$ such that every maximal 
branch of defines a substitution a' such that AT,? |= i/jqct' . 

Let d be the number of literals in ipQ. Vo is a finite labelled tree of depth 
d+ 1 whose labelling satisfies the following conditions, for i = 0 , . . . , d: 



(i) every node of V o at level i is labelled with (cr.j, V , Q cr i)) and, in particular, the 
root is labelled with (<Tq,^q<Tq), where a' 0 is the empty substitution; 


(ii) if a node at level i is labelled with {cr'i^Qcr'j), then its s-successors, with 
s > 0, are labelled with 




where qi+ \ is the (i + l)-st conjunct of ipQcr'i and 5 9i+1 = {gf +1 , . . . , pf i+1 } is 
the collection of the substitutions g = {x\/y\, . . . , Xj/yj} with {xi, . . . , Xj} = 
Varo(gi + i) such that p = qi+ig, for some literal p on •d. If s = 0, the node 
labelled with (ex', V’qc') is a leaf node and, if i = d, a\ is added to E' . 


Let S(Tkb) an d A(7jee) be, respectively, the maximum depth of Tkb and 
the number of leaves of Ties computed above. Plainly, 5(Tkb) = 0{imk r ) and 
A(7jc b ) = 0{2 imk ). It is easy to verify that s = 2 k is the maximum branch- 
ing of 'Do . Since Do is a s-ary tree of depth d + 1, where d is the number of 
literals in iJjq , and the s-successors of a node are computed in 0(5(Tkb)) time, 
the number of leaves in Do is C ) (si d+1 i) = 0{ 2 k ( d+1 ' ) ) and they are computed in 
0(2 k ( d+1 >5(T)CB)) time. Finally, since we have A(7jce) of such decision trees, the 
answer set of tpQ w.r.t. (j>jcB is computed in time 


0(2 k ^ d+1 '>5{TKB)KTKB)) = 0{ 2 fc ( d+1 ) • £mk r ■ 2 tmkr ) = 0{£mk r 2 k ( d + 1 '»+ emkr ) . 

Since the size of 4>jcb and of ipQ are polynomially related to those of KB and 
of Q, respectively (see [5] for details on the reduction), the construction of the 
answer set of Q with respect to KB can be done in double-exponential time. In 
case KB contains no role chain axioms and qualified cardinality restrictions, the 
complexity of our CQA problem is in EXPTIME, since the maximum number 
of universal quantifiers in 4>k.Bi namely r, is a constant (in particular r = 3). We 
remark that such result is comparable with the complexity of the CQA problem 
for a large family of description logics such as SHXQ [22]. In particular, the 
CQA problem for the very expressive description logic S1ZOIQ turns out to be 
2-NEXPTIME-complete. 


5 Conclusions 

We have introduced the description logic 2?£(4LQS R,X )(D) (DC^f , for short) 
that extends the logic 2?£(4LQS R ) (D) with Boolean operations on concrete roles 
and on the product of concepts. We addressed the problem of Conjunctive Query 
Answering for the description logic D£^j x by formalizing DC^f -knowledge bases 
and DCj^ -conjunctive queries in terms of formulae of 4LQS R . Such formalization 
seems to be promising for implementation purposes. 

In our approach, we first constructed a KE-tableau Tkb for <f>K,Bi a 4LQS R - 
formalization of a given DC^f -knowledge base KB , whose branches induce the 
models of 4>k:b- Then we computed the answer set of a 4LQS R -formula ipQ, rep- 
resenting a DC ^ -conjunctive query Q , with respect to </>/cb by means of a forest 
of decision trees based on the branches of Tkb and gave some complexity results. 



We plan to generalize our procedure with a data-type checker in order to 
extend reasoning with data-types, and also to extend 4LQS R with data-type 
groups. We also intend to improve the efficiency of the knowledge base saturation 
algorithm and query answering algorithm, and to extend the expressiveness of the 
ciueries. Finally, we intend to study a parallel model of the procedure described 
and to provide an implementation of it. 
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